Model Selection

128K long context

# 128K long context

Instella 3B Long Instruct

Instella-Long is an open-source language model with 3B parameters developed by AMD, supporting a context length of 128K and performing excellently in long-context benchmark tests.

Large Language Model

Gemma 3 1b It Qat Bnb 4bit

Gemma 3 is a lightweight open model series launched by Google, built on Gemini technology, supporting multimodal input and text output.

Phi 4 Mini Reasoning GGUF

Phi-4-mini-reasoning is a lightweight open model built on synthetic data, focusing on high-quality, reasoning-rich data, and further fine-tuned for more advanced mathematical reasoning capabilities.

Large Language Model

Openhands Lm 7b V0.1 GGUF

OpenHands LM is an open-source coding model built on Qwen Coder 2.5 Instruct 32B, which performs excellently in software engineering tasks through special fine-tuning.

Large Language Model English

Nu2 Lupi Qwen 14B

Nu2-Lupi-Qwen-14B is a mathematical reasoning optimized model based on the Qwen 2.5 14B architecture, excelling in complex problem-solving and logical deduction.

Large Language Model

Gemma 3 27b It Qat Unsloth Bnb 4bit

Gemma 3 is a lightweight, state-of-the-art multimodal open-source model launched by Google, capable of processing text and image inputs and generating text outputs.

Gemma 3 4b It Qat Unsloth Bnb 4bit

Gemma 3 is a lightweight, cutting-edge open model series launched by Google, built on Gemini model technology, supporting multimodal input and text output.

Gemma 3 4b It Qat GGUF

Gemma 3 is a lightweight, advanced open model series from Google, built on the same research and technology used to create Gemini models. This model is multimodal, capable of processing both text and image inputs to generate text outputs.

Text-to-Image English

Gemma 3 27b It Qat

Gemma is a lightweight open model series launched by Google, built on Gemini model technology. Gemma 3 is a multimodal model supporting text and image inputs with text outputs, featuring a 128K large context window and multilingual capabilities.

Gemma 3 12b It Qat Unsloth Bnb 4bit

Gemma 3 is a lightweight and state-of-the-art open model family launched by Google, built on the same research and technology as the Gemini model. It supports multimodal input and text output.

Gemma 3 12b It Qat GGUF

Gemma is a lightweight, advanced open model series from Google, built using the technology behind the Gemini models. Gemma 3 is a multimodal model capable of processing both text and image inputs to generate text outputs.

Gemma 3 12b It Qat

Gemma 3 is a lightweight, state-of-the-art multimodal open-source model launched by Google. It can process text and image inputs and generate text outputs, suitable for various text generation and image understanding tasks.

Synthia S1 27b Bnb 4bit

Synthia-S1-27b is an advanced reasoning AI model developed by Tesslate AI, focusing on logical reasoning, coding, and role-playing tasks.

Gemma 3 27b It Qat Q4 0 Unquantized

Gemma 3 is a lightweight and advanced multimodal open model launched by Google. It is built on the same research and technology as the Gemini model, supporting text and image inputs and generating text outputs.

Gemma 3 12b It Qat Int4 Unquantized

Gemma 3 is a lightweight multimodal open model from Google, supporting text and image inputs with text output, featuring a 128K large context window and multilingual capabilities.

Gemma 3 4b It Qat Int4 Unquantized

Gemma 3 is a lightweight multimodal open model launched by Google, supporting text and image input and generating text output. The 4B version has undergone instruction tuning and quantization-aware training, making it suitable for deployment in resource-constrained environments.

Gemma 3 1b It Qat Int4 Unquantized

Gemma is Google's lightweight advanced open model series, built with the same technology as Gemini, supporting multimodal input and text generation.

Large Language Model

Gemma 3 27b It Qat Compressed Tensors

Gemma 3 is a lightweight and advanced open model series launched by Google, built on the same research and technology as the Gemini model. This version is an instruction-tuned model with 27B parameters, using quantization-aware training (QAT) and compressed tensor technology.

Gemma 3 12b It Qat Compressed Tensors

Gemma 3 is Google's lightweight cutting-edge open model family, built on the same research and technology used to create Gemini models. This model is multimodal, capable of processing both text and image inputs to generate text outputs.

Gemma 3 4b It Qat Q4 0 Unquantized

Gemma 3 is a lightweight open-source multimodal model introduced by Google, built on the same technology as Gemini, supporting text and image inputs to generate text outputs.

Gemma 3 12b It Qat Q4 0 GGUF

Gemma is a lightweight, cutting-edge open model series from Google, built on Gemini technology. The 12B version is a multimodal model supporting text and image input, featuring a 128K large context window and support for over 140 languages.

Gemma 3 4b It Qat Autoawq

Gemma 3 is a lightweight open-source multimodal model launched by Google, built on Gemini technology, supporting text and image input and generating text output.

Gemma 3 4b It Speech

Gemma-3-MM is a multimodal instruction model extended from Gemma-3-4b-it with added speech processing capabilities, capable of handling text, image, and audio inputs to generate text outputs.

Gemma 3 27b It Int4 Awq

Gemma is a lightweight and advanced open model series launched by Google, built on the same research and technology as Gemini. The 27B version is a multimodal model that supports text and image input and generates text output.

Gemma 3 27b Pt Qat Q4 0 Gguf

Gemma is a lightweight and cutting-edge open model family launched by Google, built on the same research and technology as the Gemini model. Gemma 3 is a multimodal model that can process text and image inputs and generate text outputs.

Gemma 3 27b It Qat Q4 0 Gguf

Gemma is a lightweight open-source multimodal model series launched by Google. It supports text and image inputs and generates text outputs. It has a 128K large context window and supports over 140 languages.

Gemma 3 4b It Int4 Awq

Gemma is a lightweight, advanced open model series from Google, built using the same research technology as Gemini. Gemma 3 is a multimodal model capable of processing both text and image inputs to generate text outputs.

Gemma 3 27b Pt Bnb 4bit

Gemma 3 is a lightweight open model series launched by Google, built on the same research and technology as the Gemini model, supporting multimodal input and text output.

Transformers English

Gemma 3 1b Pt Unsloth Bnb 4bit

Gemma 3 is a series of lightweight open models launched by Google, supporting multimodal input (text and images), with a 128K large context window, suitable for various tasks such as question answering and summarization.

Transformers English

Gemma 3 4b It Qat Q4 0 Gguf

Gemma 3 is Google's lightweight cutting-edge open-source multimodal model supporting text and image inputs with text output, featuring 128K context window and 140+ language support

Gemma 3 1b It Qat Q4 0 Gguf

Gemma is Google's lightweight cutting-edge open model series, built using the same research technology as Gemini. The 1B version is instruction-tuned, suitable for deployment in resource-constrained environments.

Gemma 3 is a lightweight advanced open model series launched by Google, built on the same research and technology as the Gemini models. This model is multimodal, capable of processing both text and image inputs to generate text outputs.

Gemma is a lightweight open-source multimodal model series launched by Google, built on the same technology as Gemini, supporting text and image inputs and generating text outputs.

Phi 4 Multimodal Instruct

Phi-4-multimodal-instruct is a lightweight open-source multimodal foundation model that integrates language, vision, and speech research and datasets from Phi-3.5 and 4.0 models. It supports text, image, and audio inputs to generate text outputs, with a context length of 128K tokens.

Multimodal Fusion

Transformers Supports Multiple Languages

C4ai Command R7b Arabic 02 2025

A 7B-parameter large language model optimized for Arabic, supporting 128K context length with excellent performance in enterprise-level tasks

Large Language Model

Transformers Supports Multiple Languages

Phi 4 Multimodal Instruct Onnx

ONNX version of the Phi-4 multimodal model, quantized to int4 precision with accelerated inference via ONNX Runtime, supporting text, image, and audio inputs.

Multimodal Fusion Other

Spec-Vision-V1 is a lightweight, state-of-the-art open-source multimodal model designed for deep integration of visual and textual data, supporting a 128K context length.

Transformers Other

SVECTOR-CORPORATION

Chocolatine 2 14B Instruct V2.0.3

Chocolatine-2-14B-Instruct-v2.0.3 is a large language model based on the Qwen-2.5-14B architecture, fine-tuned with DPO, specializing in French and English tasks, and excels in the French LLM leaderboard.

Large Language Model

Transformers Supports Multiple Languages

C4ai Command R7b 12 2024

Command R7B is an open-weight 7B parameter research version model, optimized for diverse scenarios such as reasoning, summarization, Q&A, and coding, supporting 23 languages.

Large Language Model

Transformers Supports Multiple Languages

Phi-3.5 is an advanced large language model developed by Microsoft based on the Phi-3 architecture, focusing on high-quality, reasoning-rich data and supporting a context length of 128K tokens.

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase